Detection of isoniazid resistant strains of M. tuberculosis

ABSTRACT

A method for determining the susceptibility of a strain of M. tuberculosis to isoniazid is provided comprising employing the techniques of restriction fragment length polymorphism analysis to determine whether or not the DNA of said strain has an MspI restriction site at the codon corresponding to codons 315 or 463 of an M. tuberculosis katG gene consensus sequence.

This is a continuation-in-part application of U.S. patent application Ser. No. 08/228,662 filed Apr. 18, 1994, which is incorporated herein by reference.

BACKGROUND OF THE INVENTION

Despite more than a century of research since the discovery of Mycobacterium tuberculosis, the aetiological agent of tuberculosis, this disease remains one of the major causes of human morbidity and mortality. There are an estimated 3 million deaths annually attributable to tuberculosis (see, D. Snider, Rev. Inf. Dis., S335 (1989)), and although the majority of these are in developing countries, the disease is assuming renewed importance in the West due to the increasing number of homeless people and the impact the AIDS epidemic (see, R. E. Chaisson et at., Am. Res. Resp. Dis., 23, 56 (1987); D. E. Snider, Jr. et al., New Engl. J. Med., 326, 703 (1992); M. A. Fischl et al., Ann. Int. Med., 117, 177 (1992) and ibid. at 184.

Isonicotinic acid hydrazide or isoniazid (INH) has been used in the treatment of tuberculosis for the last forty years due to its exquisite potency against the members of the "tuberculosis" groups--Mycobacterium tuberculosis, M. bovis and M. africanum (G. Middlebrook, Am. Rev. Tuberc., 69, 471(1952) and J. Youatt, Am. Rev. Resp. Dis., 99, 729 (1969)). Neither the precise target of the drug, nor its made of action are known, but INH treatment results in the perturbation of several metabolic pathways of the bacterium. However, shortly after its introduction, INH-resistant isolates of Mycobacterium tuberculosis emerged. See M. L. Pearson et al., Ann. Int. Med., 117, 191 (1992) and S. W. Dooley et al., Ann. Int. Med., 117, 257 (1992).

Several investigators have associated the toxicity of INH for mycobacteria with endogenous catalase activity. See, for example, "Isonicotinic acid hydrazide," in F. E. Hahn, Mechanism of Action of Antibacterial Agents, Springer-Verlag (1979) at pages 98-119. This relationship was strengthened by a recent report by Ying Zhang and colleagues in Nature, 358, 591 (1992) which described the restoration of INH susceptibility in an INH resistant Mycobacterium smegmatis strain after transformation using the catalase-peroxide (katG) gene from an INH sensitive M. tuberculosis strain. In a follow-up study, Zhang and colleagues in Molec. Microbiol., 8, 521 (1993) demonstrated the restoration of INH susceptibility in INH resistant M. tuberculosis strains after transformation by the functional katG gene. As reported by B. Heym et al., J. Bacteriol., 175, 4255 (1993), the katG gene encodes for a 80,000-dalton protein.

A significant problem in control of multiple drug resistant tuberculosis (MDR-TB) epidemics has been the delay in the detection of drug resistance. Currently laboratory methods used to identify M. tuberculosis drug resistance require weeks to months for results. The rapid detection of M. tuberculosis directly from clinical samples has been possible recently by virtue of the availability of polymerase chain reaction (PCR) and the recognition of diagnostic sequences amplified by the appropriate primers. The ability to conduct PCR analyses depends on having a high enough gene or gene product concentration so that the molecular tools work efficiently even when the organism numbers are low. Thus, the most efficient molecular assays used to detect M. tuberculosis depend on the IS6110 insertion sequence (about 10 copies) or the 16S ribosomal RNA (thousands of copies). See, respectively, K. D. Eisenach et al., J. Infect. Dis., 61,997 (1990) and N. Miller et al., Abstracts ASM, Atlanta, Ga. (1993) at page 177. Recently, B. Heym et al., (PCT WO 93122454) disclose the use of polymerase chain reaction to amplify portions of the katG gene of putative resistant strains. The PCR products were evaluated by single-strand conformation polymorphism (SSCP) analysis, wherein abnormal strand motility on a gel is associated with mutational events in the gene. For example, in five strains, a single base difference was found in a 200 bp sequence, a G to T transversion at position 3360. This difference would result in the substitution of Arg-461 by Leu. However, carrying out SSCP on a given clinical sample can be a laborious procedure that requires sequencing to confirm whether mutations or deletions predictive of drug resistance are in fact present in the target gene.

There is a continuing need in the art to develop a simple test permitting the rapid identification of INH-resistant strains of M. tuberculosis.

SUMMARY OF THE INVENTION

The present invention provides a method to rapidly identify strains of M. tuberculosis which are resistant to isoniazid (INH). The method is based on our discovery that certain mutations in the katG gene of M. tuberculosis which confer INH resistance coincidentally result in the addition or deletion of restriction sites, which are recognized by various restriction enzymes. For example, the wild-type (WT) katG gene of M. tuberculosis contains an NciI-MspI restriction site spanning codon 463, which site is absent in the corresponding codon 463 in a number of INH resistant strains due to a single base mutation in that codon. An NciI-MspI restriction site is a site cleaved by both NciI and MspI. This site is represented by the nucleotide sequence CCGGG (see Table 1).

Alternatively, or in addition, some INH resistant strains have a single base mutation in codon 315 in the katG gene that produces a new MspI restriction site associated with the corresponding codon 315 in the INH resistant strain. Some INH resistant strains have a single base mutation at codon 337 that results in the deletion of a RsaI restriction site otherwise present at the corresponding position in the WT gene; and some have a single base mutation at codon 264 that eliminates a CfoI restriction site. These mutations may be present singly or in combination in INH resistant M. tuberculosis strains.

When used in reference to nucleotide position, codon position or restriction site position, the term "corresponding" is defined to mean the same absolute location on two different M. tuberculosis katG genes, wherein absolute location is defined by the numbering system used in FIG. 7 (SEQ ID NO: 20). For example, a wild-type codon 463 represented by CGG at nucleotide positions 1456-1458 on a wild-type katG gene of M. tuberculosis and a mutant codon 463 represented by CTG at the same nucleotide positions 1456-1458 on a katG gene of an INH resistant strain of M. tuberculosis are considered to be corresponding codons.

The determination of whether one or more of these identifying mutations in the katG gene are present in a strain of M. tuberculosis can be made by employing the techniques of restriction fragment length polymorphism (RFLP) analysis. Therefore, in an embodiment directed to the identification of a mutation in codon 463 that is associated with INH resistance, the present assay comprises the steps of:

(a) amplifying a portion of the katG gene of an M. tuberculosis isolate to yield a detectable amount of DNA comprising the nucleotide position occupied by base 1457 of the M. tuberculosis katG gene consensus sequence depicted in

FIG. 7 (SEQ ID NO: 20); and

(b) determining whether an NciI-MspI restriction site is absent

in codon 463 of said katG gene, wherein said absence is

indicative of an INH resistant strain of M. tuberculosis.

The RFLP technique involves cleaving the DNA with a restriction endonuclease which cleaves at an NciI-MspI restriction site to yield at least one DNA fragment and determining whether the number and location of the fragments is indicative of the absence of an NciI-MspI restriction site in codon 463 of said katG gene, wherein said absence is indicative of an INH resistant strain of M. tuberculosis, preferably by employing the techniques of gel electrophoresis.

If the amplified DNA of step (a) contains no NciI-MspI restriction sites, then the DNA fragment yielded in step (b) will be identical to the amplified DNA of step (a). This can occur where the portion of the katG gene amplified in step (a) is from an INH resistant strain of M. tuberculosis having a mutation in codon 463 that removes the NciI-MspI restriction site spanning that codon in the wild-type katG gene, and having no other additional NciI-MspI restriction sites.

In order for the amplified DNA to yield a meaningful RFLP pattern, the portion of the katG gene amplified in step (a) will be of sufficient length to produce fragments of sufficient length to visualize using gel electrophoresis. In the above-described embodiment, for example, the portion amplified will contain a sufficient number of bases to either side (5' or 3') of codon 463 such that cleavage at a site spanning that codon will yield fragments that can be visualized using gel electrophoresis.

In another embodiment of the invention directed to the additional identification of a mutation in codon 315 associated with INH resistance, the amplified DNA of step (a) further comprises at least one MspI restriction site and the nucleotide position occupied by base 1013 (FIG. 7, SEQ ID NO:20), and the determination made in step (b) further includes whether an MspI restriction site associated with codon 315 is present, wherein said presence is indicative of an INH resistant strain. For example, RFLP can also be employed to determine whether the number and location of the fragments is indicative of the codon 315 MspI restriction site. Preferably, the portion of the katG locus which is amplified is a minor portion of the entire katG gene, i.e., less than 1500 base pair, more preferably less than 1000 base pair, and is isolated and amplified by polymerase chain reaction, as described hereinbelow. The term "location" refers to the Rf (relative electrophoretic mobility) of a given fragment on the gel.

The pattern of fragments produced on a gel by electrophoresis of a restriction digest of an amplified portion of the katG gene of an M. tuberculosis strain of interest, such as an INH resistant strain, is preferably compared to the pattern produced in a digest of an equivalent portion of the katG gene of a wild-type (WT) control strain of M. tuberculosis, which strain is INH sensitive. The term "equivalent" is defined herein to mean that any two portions of the katG gene would comprise the same number and location of restriction sites being analyzed (e.g., sites recognized by CfoI, RsaI, MspI, and/or NciI) if the portions both were selected from a portion of the DNA of SEQ ID NO: 20 (i.e., if there were no mutations altering the number of restriction sites of the type being analyzed), and that the portions do not differ in size before cleavage to the extent that the number of fragments obtained cannot be compared following side-by-side gel electrophoresis and visualization of the resultant fragments, as described hereinbelow. For example, the control katG DNA can correspond to an equivalent portion of SEQ ID NO:20 (FIG. 7, upper sequence) comprising one or more of the codons of interest (e.g., codons 315 or 463) and their associated restriction sites. As discussed below, such a portion of DNA can be derived from strain H37Rv MC. A positive control corresponding to DNA fragments derived from a known INH resistant strain may also be used.

In the embodiment of the assay of the invention directed to the determination of the presence or absence of a NciI-MspI restriction site associated with codon 463, gel electrophoresis is employed to compare the number and location of the DNA fragments to the number and location of DNA fragments derived from cleavage of DNA derived from an equivalent portion of the katG gene wherein the NciI-MspI restriction site at codon 463 is determination of the absence of the restriction site at codon 463 in the katG gene is indicative of an INH resistant strain of M. tuberculosis. Preferably, the control DNA sequence of the portion of the katG gene wherein the codon 463 restriction site is present corresponds to a portion of SEQ ID NO:20 (FIG. 7, upper sequence). For example, the control DNA may contain five NciI-MspI restriction sites in each DNA molecule prior to cleavage, and the DNA of step (a), which is derived from an INH resistant strain, may contain four NciI-MspI restriction sites in each DNA molecule prior to cleavage. The assay also preferably includes positive control DNA fragments derived from an INH resistant strain which does not include the codon 463 NciI-MspI restriction site in the katG gene.

The present invention also provides oligonucleotides and subunits thereof useful in pairs as primers to initiate the polymerase chain reaction (PCR). Subunits of at least seven bases in length are preferred. PCR is useful both to amplify katG DNA so as to prepare both the target DNA of step (a) of the present process, as well as the DNA which is used to prepare the control digest.

The present invention also provides isolated, purified DNA represented by the consensus sequence derived for the M. tuberculosis katG gene. This DNA was found to occur in nature as the katG gene of M. tuberculosis strain H37Rv MC, as maintained at the Mayo Clinic, and is also referred to as the wild-type (WT) DNA. The present invention also includes isolated, purified DNA encoding the consensus amino acid sequence encoded by the consensus wild-type katG DNA, as well as DNA sequences that differ in sequence but which also encode this amino acid sequence (a consensus catalase peroxidase polypeptide) and can be employed to provide the isolated, purified polypeptide represented by the consensus amino acid sequence, which polypeptide is also provided by the invention.

The polypeptide of the invention can be prepared by expression in transformed host cells, such as bacteria, yeast, plant, or insect cells transformed with the DNA sequences of the present invention, operatively linked to regulatory regions functional in the transformed host cells. The polypeptide can be used as a standard M. tuberculosis catalase peroxidase, to correlate enzymatic activity (relative level, loss and restoration), with INH modification and degradation and drug resistance in M. tuberculosis.

The present invention also provides a kit comprising, separately packaged in association:

(a) a pair of oligonucleotide primers selected so as to amplify a portion of the DNA of the M. tuberculosis katG gene comprising base 1457 in codon 463 or base 1013 in codon 315, as depicted in FIG. 7 (SEQ ID NO:20); and

(b) an amount of a restriction endonuclease such as MspI, effective to cleave the amplified portion of said DNA at a restriction site comprising said base 1457 or said base 1013.

The present kits will also preferably comprise instruction means for carrying out the present assay, i.e., a printed package insert, tag or label, or an audio or video tape. The present kits will also preferably comprise a control DNA digest prepared by amplifying a portion of the consensus DNA of SEQ ID NO:20 (FIG. 7), that is equivalent to the portion defined and amplified by the pair of primers, followed by digestion of the DNA with a suitable restriction endonuclease such as MspI.

The present invention is exemplified by the use of NciI, MspI, CfoI, and RsaI digestions, with the use of MspI digestion being preferred; however, any restriction endonuclease having a restriction site spanning all or a portion of codon 463, codon 315, codon 337, or codon 264, which portion contains the site of the single base mutation associated with INH resistance as identified in Table 2, may be used, as desired. For example, the restriction endonucleases listed in Table 1 can be employed. Particularly preferred are restriction endonucleases having a restriction site that contains the position occupied by base 1457 in codon 463, or base 1013 in codon 315, as depicted in FIG. 7 (SEQ ID NO:20).

                  TABLE 1     ______________________________________     M. tuber-     culosis*     katG gene     Specificity             Restriction Site                             Restriction Enzyme     ______________________________________     Cuts 264-A             C/CGC           AciI.sup.a     (sensitive)             GC/NGC.sup.b    BsoFI, Fnu4HI, Bsp6I, BssFI,                             BssXI, Cac824I, CcoP215I,                             CcoP216I, FbrI, ItaI, Uur960I             R/GCGCY.sup.c   Bsp143II, HaeII, Bme14ZI,                             BsmHI, Bst1473II, Bst16I,                             Btu34II, HinHI, LpnI, NgoI             G/CGC           CfoI, HhaI, BcaI, CcoP95I,                             Csp1470I, FnuDIII, Hin6I,                             Hin7I, HinGUI, HinPII,                             IlinSII, IlinS2I, MnnIV, SciNI     Cuts 264-T             GACGCNNNNN/NNNNN                             HgaI     (resistant)             (SEQ ID NO: 22)     Cuts 337-Y             GT/AC           RsaI, AfaI, Asp16HI, Asp17HI,     (resistant)             Asp18HI, Asp29HI, CcoP73I,                             Csp6I, CviQI, CviRII     Cuts 337-C             GC/NGC          BsoFI, Fnu1HI, Bsp61, BssFI,     (resistant)             BssFI, BssXI, Cac824I,                             CcoP215I, CcoP216I, FbrI,                             ItaI, Uur960I             C/CGC           AciI.sup.a     Cuts 315-S             C/CGC           AciI.sup.a     (sensitive)             GC/NGC          BsoFI, Fnu4HI, Bsp6I, BssFI,                             BssXI, Cac824I, CcoP215I,                             CcoP216I, FbrI, ItaI, Uur960I             CMG/CKG.sup.d   MspAII, NspBII     Cuts 315-T             R/CCGGY.sup.e   BsrFI, Cfr10I, Bco118I,     (resistant)             Bse118I, Bsp21I, BssAI             C/CGG           MspI, Bsu1192I, BsuFI, FinII,                             HapII, Hin2I, Hin5I, HpaII,                             MniII, MnoI, MspI, Pde137I,                             Pme35I, SecII, SfaGUI,                             Sth134I, Uba1128I, Uba1141I,                             Uba1267I, Uba1338I,                             Uba1355I, Uba1439I     Cuts 463-R             CC/SGG.sup.f    NciI, BcnI, AhaI     (sensitive)             C/CGG           MspI, Bsu1192I, BsuFI, FinII,                             HapII, Hin2I, Hin5I, HpaII,                             MniII, MnoI, MspI, Pde137I,                             Pme35I, SecII, SfaGUI,                             Sth134I, Uba1128I, Uba1141I,                             Uba1267I, Uba1338I,                             Uba1355I, Uba1439I     Cuts 463-L             CAG/NNN/CTG     AlwNI     (resistant)             CC/WGG.sup.g    BstNI, BstOI, MvaI             /CCWGG          EcoRII     ______________________________________      .sup.a AciI cleaves the complementary strand of the katG gene;      .sup.b N = C or G or A or T;      .sup.c R = A or G;      .sup.d M = A or C, K = G or T;      .sup.e Y = C or T;      .sup.f S = C or G;      .sup.g W = A or T.

                                      TABLE 2.sup.a     __________________________________________________________________________     264-A (sensitive).sup.b             (SEQ ID NO: 8)                      847                        GTC GAA ACA GCG GCG CTG ATC GTC GGC                                               873     264-T (resistant)             (SEQ ID NO: 9)                        GTC GAA ACA GCG ACG CTG ATC GTC GGC     337-Y (sensitive).sup.c             (SEQ ID NO: 10)                     1066                        CTC GAG ATC CTG TACGGC TAC GAG TGG                                              1092     337-C (resistant)             (SEQ ID NO: 11)                        CTC GAG ATC CTG TGC GGC TAC GAG TGG     315-S (sensitive).sup.d             (SEQ ID NO: 12)                     1000                        GAC GCG ATC ACC AGC GGC ATC GAG GTC                                              1026     315-T (resistant)             (SEQ ID NO: 13)                        GAC GCG ATC ACC ACC GGC ATC GAG GTC     463-R (sensitive).sup.e             (SEQ ID NO: 14)                     1444                        AAG AGC CAG ATC CGG GCA TCG GGA TTG                                              1470     463-L (resistant)             (SEQ ID NO: 15)                        AAG AGC CAG ATC CTG GCA TCG GGA TTG     __________________________________________________________________________      .sup.a The underlined codons represent the sites where the indicated      single base mutations confer INH resistance. The bold bases indicate      restriction sites as follows: G/CGC for CfoI in 264A (sensitive); GT/AC      for RsaI in 337Y (sensitive); C/CGG for MspI in 315T (resistant) and 463R      (sensitive). For ease of reference, the partial sequences shown in this      table include the 12 bases to either side of the affected codon; the      numbering system is the same as used for the wildtype consensus sequence      in FIGS. 1 and 7. The full sequence of bases to either side of the      affected codon is shown in FIG. 7. In each of the sensitive/resistant      pairs shown in this table, the upper sequence is the consensus, wildtype      sequence (INHsensitive) and the lower sequence is the mutant      (INHresistant) sequence.      .sup.b codon 264 GCG = ala (A) ACG = thr (T)      .sup.c codon 337 TAC = tyr (Y) TGC = cys (C)      .sup.d codon 315 AGC = ser (S) ACC = thr (T)      .sup.e codon 463 CGG = arg (R) CTG = leu (L)

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1, panels A-D, depicts the consensus, wild-type DNA sequence of the M. tuberculosis katG gene as the upper of the pair of sequences (61-2295) (SEQ ID NO:1). This DNA sequence data has been submitted to Gen Bank and has been assigned accession number UO6262. The lower of the pair of sequences depicts nucleotide sequence 1970-4190 of the KpnI fragment bearing the katG gene as depicted in FIG. 6 of Institute Pasteur et al. (published PCT application WO 93/22454). This sequence (SEQ ID NO:2) has been deposited in the EMBL data library under accession number X68081 (Gen Bank X68081. gb₋₋ ba). Dots (.) above the sequence mark every tenth base. The upper sequence is in lower case in areas where variation in the sequence among isolates described hereinbelow and the consensus sequence was found. The arrow before position 70 and after position 2291 of the upper sequence indicate the coding sequence of the katG gene.

FIG. 2 depicts the katG amino acid consensus sequence derived from 15 strains of M. tuberculosis (SEQ ID NO:7).

FIG. 3 schematically depicts the NciI restriction sites for the part B amplicon of katG. The (*) depicts the site of the Arg→Leu mutation which is found in some INH resistant M. tuberculosis strains.

FIG. 4 depicts the results of a gel electrophoresis of the NciI digest of the part B amplicon of 14 strains of M. tuberculosis (1-14).

FIG. 5 schematically depicts MspI and RsaI restriction sites and resulting RFLP fragments for a portion of the M. tuberculosis katG gene. For MspI, restriction maps for wild-type (W+), single (315 Ser→Thr or 463 Arg→Leu) mutants and the double (315 Ser→Thr and 463 Arg→Leu) mutant are shown. For RsaI, restriction maps for wild-type (W+) and the 337 W→C mutant are shown.

FIG. 6 schematically depicts CfoI restriction sites and resulting RFLP fragments for a portion of the M. tuberculosis katG gene. Restriction maps for wild-type (W+) and the 264 A→T mutant are shown.

FIG. 7, panels A-C, depict as the upper of the pair of sequences the consensus, wild-type DNA sequence of the M. tuberculosis katG gene (SEQ ID NO:20), and as the lower of the pair of sequences the amino acid consensus sequence encoded thereby (SEQ ID NO:21). This information is updated from that presented in FIGS. 1 (SEQ ID NO:1) and 2 (SEQ ID NO:7) and the numbering system is as used therein. The amino acid and nucleotide sequences are arranged in this figure so as to facilitate convenient determination of which codons encode which amino acid in the polypeptide sequence.

FIG. 8 depicts the RFLP patterns produced by an MspI restriction digest of an amplified portion of the DNA of the katG genes of wild-type and mutant strains of M. tuberculosis, wherein the mutant DNA contains mutations at either codon 315 or codon 463, or both.

FIG. 9 depicts the RFLP patterns produced by an RsaI restriction digest of an amplified portion of the DNA of the katG genes of wild-type and mutant strains of M. tuberculosis, wherein the mutant DNA contain a mutation at codon 337.

FIG. 10 depicts the RFLP patterns produced by a CfoI restriction digest of an amplified portion of the DNA of the katG genes of wild-type and mutant strains of M. tuberculosis, wherein the mutant DNA contains a mutation codon 264.

DETAILED DESCRIPTION OF THE INVENTION

Wild type strains of M. tuberculosis are highly susceptible to isoniazid (INH) with minimum inhibitory INH concentration (MIC or IC_(min)) ≦0.02 μg/ml, and a susceptible strain is considered to be one with an IC_(min) <1.0 μg/ml. At the Mayo Clinic, Rochester, Minn., clinical strains of M. tuberculosis (including MDR-TB strains) were identified which exhibit intermediate to high level resistance to INH (IC_(min) range 1.0 to >32 μg/ml). Many of these strains, especially those highly resistant to INH (≧4.0 μg/ml), exhibited diminished catalase activity as assessed by a semiquantitative technique. The mean semiquantitative catalase was 16.5 mm for 6/15 strains with INH IC_(min) <1.0 μg/ml and 13.3 mm for 9/15 strains with IC_(min) ≧1.0 μg/ml.

To develop the present assay, it was first necessary to determine whether some M. tuberculosis strains have decreased INH sensitivity as a result of katG gene mutations. Therefore, the nucleic acid sequences of the katG genes for both INH sensitive (IC_(min) <1.0 μg/ml) and INH resistant (IC_(min) ≧1.0 μg/ml) M. tuberculosis strains were determined. From the DNA sequencing data generated, a katG consensus sequence was derived, and katG sequences from all 15 M. tuberculosis strains (INH sensitive and INH resistant) were compared to the consensus sequence to determine katG deviations.

Five of nine INH resistant strains (INH IC_(min) ≧1.0 μg/ml) had one or more missense mutations; one had a nonsense mutation; one had an 8 base pair deletion; and two had no mutations in the coding sequences. All of the five strains with missense mutations had a common G to T transversion at base 1457 in codon 463 (bases 1456-1458) causing replacement of arginine with leucine and loss of an NciI-MspI restriction site. Two of those having mutations at codon 463 also showed a G to C transversion at base 1013 in codon 315 (bases 1012-1014) causing replacement of serine with threonine. A third contained a G to A transversion at base 859 in codon 264 (bases 859-861) resulting in the replacement of alanine by threonine, and a fourth contained an A to G transversion at base 1079 in codon 337 (bases 1078-1080), causing tyrosine to be replaced by cysteine. The numbering system is shown in FIG. 1 (SEQ ID NO:1). The affected codons and portions of the DNA sequences on either side are shown for both INH sensitive and INH resistant strains in Table 2.

Six INH sensitive strains (INH IC_(min) <1.0 μg/ml) were also sequenced and found to have from none to 5 amino acid differences with the consensus sequence of all 15 strains, but none of the mutations affected codons 463, 315, 264, or 337 or their overlapping restriction sites. Restriction analysis of a total of 32 sensitive and 43 resistant strains revealed a common restriction fragment length polymorphism (RFLP) in nearly half (19) of the 43 of INH resistant strains, but only one of the INH sensitive strains. Specifically, 44% of the INH resistant had lost the NciI-MspI restriction site at the locus of codon 463 while only 1 of 32 sensitive strains had this restriction polymorphism.

Subsequently, the frequency of codon 463 (R→L) and codon 315 (S→T) mutations in 97 M. tuberculosis clinical isolates was determined. These isolates were obtained from patients treated at Mayo Clinic and samples referred from other health care institutions. Restriction fragment length polymorphism (RFLP) analysis using the MspI restriction enzyme, which cleaves at a site spanning the consensus codon 463 site and at a site comprising a portion of the mutant codon 315 site on the katG gene of M. tuberculosis, was performed on amplified DNA from 97 clinical isolates. Comparison of the resulting RFLP patterns and IC_(min) for isoniazid revealed that of the 90 INH-resistant strains, approximately 10% had both mutations, 20% had the 315 S→T mutation only, and 26% had the 463 R→L mutation only. Thus, 51 of the 90 resistant strains were identified by RFLP as having mutations at codons 463, 315 or both, resulting in a detection of over 50% of the resistant strains by this molecular method in a single experiment. Only one of the seven INH sensitive strains was found to have the 463 R→L mutation, and none of the INH sensitive strains had the 315 S→T mutation. Greater INH resistance (>4.0 μg, INH/ml) is associated with the 315 S→T mutation, but not if the 463 R→L mutation is also present.

These results indicate that two mutations, arginine→leucine in codon 463 and serine→leucine in codon 315 of the M. tuberculosis catalase-peroxidase (katG) gene occur in a significant fraction of INH resistant M. tuberculosis strains (INH IC_(min) ≧1.0 μg/ml). Furthermore, these single base mutations can be determined using a rapid relatively simple method, i.e., PCR amplification, digestion and monitoring for a loss of an NciI and/or an MspI restriction site at codon 463, and the addition of an MspI restriction site at codon 315, by RFLP, as described in detail hereinbelow. Other restriction endonucleases can be used to determine whether or not these single base mutations exist in a katG gene of interest, as long as the restriction site cleaved by the restriction endonucleases contains the affected base, such that the endonuclease cleaves the wild-type sequence but not the corresponding mutant sequence, or vice versa. Although in a preferred embodiment of the invention, the number and location of the fragments is determined by gel electrophoresis, the presence or absence in the digest of a fragment comprising the indicated restriction sites can be determined by other methods known to the art, including immunoassays (dot blots and reverse dot blots), DNA probes, microtiter well capture and the like.

The present invention will be further described by reference to the following detailed examples. The 58 clinical strains of Mycobacterium tuberculosis used in Examples 1 and 2 were obtained from the Mycobacteriology Laboratory at the Mayo Clinic, Rochester, Minn., and the 17 M. tuberculosis DNA preparations were obtained from the GWL Hansen's Disease Center, Louisiana State University, Baton Rouge, La. The strain designated H37Rv MC has been maintained at the Mayo Clinic for over 50 years, and therefore was isolated before INH became available as a treatment modality for tuberculosis (circa 1952). H37Rv was deposited in the American Type Tissue Collection, Rockville, Md. in 1937 by A. Karlson of the Mayo Clinic under the accession number ATCC 25618, and has been freely available to the scientific community since. An apparent variant of this strain is disclosed in PCT WO 93/22454 (SEQ ID NO:2, herein). The ATCC strains 27294 and 25618 were recovered from the same patient in 1905 and 1934, respectively. All clinical M. tuberculosis strains were confirmed as M. tuberculosis using routine identification techniques described by J. A. Washington, "Mycobacteria and Norcardia," in: Laboratory Procedures in Clinical Microbiology, 2d ed., Springer-Verlag, N.Y. (1985) at pages 379-417.

For the 15 M. tuberculosis strains for which complete katG DNA sequencing was performed, susceptibility testing was done at the Mayo Clinic using Middlebrook 7H11 agar (DiMed, Inc., St. Paul, Minn. 55113) and the 1% proportion method described in Manual of Clinical Microbiology, 5th ed., A. Balows et al., eds., Amer. Soc. Microbiol. (1991) at pages 1138-52. The same method was used at the Mayo Clinic to determine susceptibility for an additional 43 M. tuberculosis strains for which restriction fragment length polymorphisms (RFLP) were determined. Isoniazid concentrations tested using this method included: 0.12, 0.25, 1.0, 2, 4, 8, 16, 32 μg/ml for the 15 strains sequenced and 1.0 and 4.0 μg/ml for the remaining 45 strains. Isoniazid resistance was defined as a maximum inhibitory concentration (IC_(min))≧1.0 μg/ml. Susceptibility testing was performed elsewhere for an additional 17 M. tuberculosis strains for which DNA lysates were provided by Diana L. Williams, Baton Rouge, La. These strains were of diverse geographical origin. 10 of these 17 strains, originated from Japan. The remaining 7 M. tuberculosis INH resistant strains included multiple drug resistant strains from recent multiple drug resistant tuberculosis (MDR-TB) nosocomial epidemics in New York, N.Y. and Newark, N.J. All were INH resistant (IC_(min) ≧1.0 μg/ml), and had resistance to at least one other drug. For all strains provided by Williams, the 1% direct proportion method was used, but the concentration of INH tested, and the media used varied as to site.

To conduct a semiquantitative test of catalase activity, M. tuberculosis strains were propagated on Lowenstein-Jensen media deeps contained in 20×150 mm screw-capped tubes. One ml of a 30% hydrogen peroxide (EM, Science, Gibbstown, N.Y. 08027) and 10% Tween 80 (Aldrich Chemical Co., Milwaukee, Wis. 53233) solution mixed in a 1:1 ratio was applied to the surface of growth. After 5 minutes, the highest (mm) of the column of bubbles (O₂) generated was recorded.

EXAMPLE 1. DNA Isolation and Polymerase Chain Reaction.

A. DNA Isolation.

For M. tuberculosis strains obtained from Mayo Clinic samples, DNA was extracted from cells using phenol (Boehringer Mannheim, Indianapolis, Ind. 46250-0414) and TE (1.0M Tris HC1 pH 8.0, 0.1M EDTA, Sigma, St. Louis, Mo. 63778) in a ratio of 600 μl:400 μl and 0.1 mm zirconium beads (Biospec Products, Bartlesville, Okla. 74005). The mixture was processed in a mini-bead beater for 30 seconds and allowed to stand for an additional 15 minutes. Following a brief centrifugation to sediment the zinconium beads, DNA in the supernatant was extracted using the IsoQuick kit (MicroProbe Corp., Garden Grove, Calif. 92641).

B. PCR Using Primer Pairs A1-A4 and B1-B2.

The DNA sequence for katG (EMBL no. X6808124) employed to design primers is depicted in FIG. 1(A-D), lower strand. The PCR method of R. K. Saiki et al., Science, 239, 487 (1988) was used to amplify the katG gene (ca. 2220 base pairs) in two segments which were designated A and B. Genomic DNA preparations (2 μl) were used with primers A1 (5' TCGGACCATAACGGCTTCCTGTTGGACGAG 3') (SEQ ID NO:3) and A4 (5' AATCTGCTTCGCCGACGAGGTCGTGCTGAC 3') (SEQ ID NO:4) or B1 (5' CACCCCGACGAAATGGGACAACAGTTTCCT 3') (SEQ ID NO:5) and B2 (5' GGGTCTGACAAATCGCGCCGGGCAAACACC 3') (SEQ ID NO:6).

The PCR mixture (50 μl) contained 10 mM TRIS, pH 8.3, 50 mM KCl, 1.5 mM MgCl₂, 0.2 mM each of dATP, dTTP, dGTP, dCTP, 1μM of each primer pair, 10% glycerol, 1.25 units/50 μl AmpliTaq DNA polymerase (Perkin Elmer Cetus). The mixture was overlaid with mineral oil and subjected to 4 min at 95° C. followed by 50 cycles of 1 min at 94° C. and 2 min at 74° C. A 1495 base pair product from the first half of katG was generated from the A1-A4 primers and 1435 base pair product was generated with the B 1-B2 primer pair.

EXAMPLE 2. DNA Sequencing and Homology Analysis.

The polymerase chain reaction (PCR) products were prepared for sequencing using the Magic™ PCR Preps DNA Purification System (Promega Corp., Madison, Wis. 53711). The DNA sequences were determined in both directions using the Taq dye-deoxy terminator cycle sequencing kit and 373A DNA sequencer (Applied Biosystems, Foster City, Calif. 94404) using a series of internal sequencing primers which provided appropriate coverage of katG.

The sequence data were analyzed using version 7 of the Genetics Computer Group sequence analysis software, as disclosed by J. Devereux et al., Nucl. Acids Res., 12, 387 (1984). From the 15 M. tuberculosis DNA sequences, a consensus sequence was derived to which all M. tuberculosis strains were compared. This consensus sequence is depicted in FIG. 1 (A-D) (SEQ ID NO:1) as the upper strand, and is compared to the sequence for katG (EMBL no. X6808124), depicted as the lower strand. The two sequences have 98.6% identity, as determined by the GCG program BESTFIT. The DNA sequence data has been submitted to Gen Bank and can be referenced by the accession numbers UO6262 (H37Rv MC), UO6258 (ATCC 25618), UO6259 (ATCC 27294), UO6260 (G6108), UO6261 (H35827), UO6270 (L6627-92), UO6271 (L68372), UO6264 (L11150), UO6268 (L24204), UO6269 (L33308), UO6265 (L16980), UO6266 (L1781), UO6272 (TMC306), UO6263 (L10373), and UO6267 (L23261). An updated, more complete and accurate M. tuberculosis katG gene sequence is presented in FIG. 7 (A-C) (SEQ ID NO:20).

The DNA data was then translated, aligned for comparison and a consensus amino acid sequence was generated (FIG. 2) (SEQ ID NO:7). The consensus amino acid sequence (SEQ ID NO:21) generated from the DNA of SEQ ID NO:20 is also presented in FIG. 7.

In general, the overall sequence agreement between INH sensitive and resistant strains was very high; the only deviations are those shown in Table 3.

                                      TABLE 3     __________________________________________________________________________     Analysis of Catalase-Peroxidase (katG) Gene in M. tuberculosis     __________________________________________________________________________     Strains            INH            MIC.sup.A            (μg/ml)                     Amino Acid Codon.sup.b     Strain INH Catalase                     2   10  17  90  224 243     __________________________________________________________________________     H37Rv MC            <0.12                20     ATCC 25618            <0.12                12     ATCC 27294            0.12                28   P-S     S-N     Q-E A-S     G6108  <0.12                12     H35827 0.25                14     L6627-92            0.5 13     L68372 1   8     L11150 8   28     L24204 8   36     L33308 8   15     L16980 16  15     L1781  32  5     TMC 306            >32 5                W*.sup.c     L10373 >32 5        8 bpd.sup.d     L23261 >32 5                Consensus                     P       S   W   Q   A     __________________________________________________________________________            INH            MIC.sup.A            (μg/ml)                     Amino Acid (Codon).sup.b     Strain INH Catalase                     264                        315                           337                              424                                 463                                    505                                       550                                          609     __________________________________________________________________________     H37Rv MC            <0.12                20     ATCC 25618            <0.12                12     ATCC 27294            0.12                28                     A-D     G6108  <0.12                12                        M-I     H35827 0.25                14     L6627-92            0.5 13     L68372 1   8          Y-C   R-L     L11150 8   28     L24204 8   36      S-T      R-L     L33308 8   15     L16980 16  15      S-T      R-L     L1781  32  5    A-T         R-L     TMC 306            >32 5     L10373 >32 5             A-V      A-D     L23261 >32 5                R-L                                    W-R   M-I                Consensus                     A  S  Y  A  R  W  A  M     __________________________________________________________________________      .sup.A MIC denotes Maximum Inhibitory Concentration, INH denotes isoniazi      .sup.b A denotes alanine, C cysteine, D aspartic acid, E glutamic acid, F      phenylalanine, G glycine, I isoleucine, K lysine, L leucine, M methionine      N asparagine, P proline, Q glutamine, R arginine S serine, T threonine, V      valine, W tryptophan, Y tryosine, B bpd B base pair deletion      .sup.c TGG→TGA (W→stop codon)      .sup.d 8 base pair deletion corresponding to wild type coordinates 98-105      creates a new TAG stop codon beginning 11 bp from coordinate 97.

The data in Table 3 show that six strains, H37Rv MC, ATCC 25618, H35827, L6627-92, L11150, and L33308, are completely homologous to the consensus at the indicated sites. Four are INH sensitive (INH IC_(min) <1.0 μg/ml) and two are INH resistant (IC_(min) >1.0 μg/ml). All other strains listed in Table 3 had 1 to 5 differences with the consensus and there was no strong correlation between the number of differences and INH sensitivity.

In the group of INH resistant strains, the most frequent change observed was the conversion of arginine at codon 463 to leucine. This was detected in five of nine isolates examined. There was not a consistent correlation between the loss of catalase activity and INH resistance since strains L11150 and L24204 had high levels of enzymatic activity, yet were INH resistant. Moreover, several other INH resistant strains showed catalase activity near the mean activity (16.5 mm) of the sensitive strains. Two other isolates had lost the ability to make normal katG gene product due either to an eight bp deletion (L10373, semiquantitative catalase, 3mm) or a nonsense mutation (TMC 306, semiquantitative catalase 5 mm). It was not possible to determine if, or how, any of the deviations from the consensus reported in Table 3 affect catalase activity or cause INH resistance. However, the change at codon 463 is frequent enough that is indicative of resistance.

The DNA sequence analysis indicated that the codon 463 occurs in the context of an NciI-MspI restriction site (both enzymes recognize the same site). Thus, when in the wild type sequence depicted in FIG. 1 at bases 1455-1458, CCGGG, is changed to CCTGG, it is no longer recognized (or cleaved) by either of these enzymes. The 1435 bp amplicon produced from the half of katG gene containing codon 463 normally has five NciI-MspI restriction sites whereas the codon altered strains have only four sites, as shown in FIG. 3. The loss of the site in question causes a unique restriction fragment length polymorphism (RFLP), which can be readily adapted to assay for resistant strains, as described in Example 3, below.

EXAMPLE 3. RFLP Analysis: MspI-NciI site in Codon 463

For restriction fragment length polymorphism (RFLP) analysis, a 1435 base pair amplimer (produced using the B 1-B2 primers) representing the 3' half of the katG gene was generated using PCR and then digested with NciI or MspI (Sigma Chemical Co., St. Louis, Mo. 63178). The gene fragments were analyzed with agarose gel electrophoresis using 2% Metaphor agarose (FMC BioProducts, Richland, Me. 04811). The gel was stained with ethidium bromide and photographed. The investigator who performed all restriction digests and electrophoresis was blinded as to the INH IC_(min) results.

The results of this experiment are depicted in FIG. 4, wherein Lane 1 denotes strain H37Rv MC, IC_(min) =<0.12 μg/mL; (2) L6627-92, 0.5 μg/mL; (3) L68372, 1.0 μg/ml; (4) L16980, 16 μg/mL; (5) L39791, 16 μg/mL; (6) L1781, 32 μg/mL; (7) L9118, 4 μg/mL; (8) L11150, 8 μg/mL; (9) L24204, 8 μg/mL; (10) L68858, <0.12 μg/mL; (11) 1115A<0.12 μg/mL; (12) L23261, >32 μg/mL; (13) 1341, >32 μg/mL; (14) M10838, >32 μg/mL; (15) molecular weight standard: PCR markers (United States Biochemical Corp., Cleveland, Ohio 44122). The digests obtained from resistant strains can be readily visually detected and differentiated from digests from susceptible strains.

Subsequently, a total of 75 M. tuberculosis strains (including the 15 strains sequenced) were analyzed for their loss of the appropriate restriction site. Of these strains, 32 were INH sensitive and 43 were INH resistant. The data showed that 19 (44%) of the 43 resistant strains had lost the expected restriction site in codon 463. One of the 33 (2.9%) sensitive strains had lost this restriction sites as well. None of the six sensitive strains listed in Table 3 lost this site.

EXAMPLE 4. Determination of the Presence or Absence of Mutations at Codons 264, 315, 337 or 463 in the M. tuberculosis katG Gene

A. Materials.

Primer pairs used for polymerase chain reaction were katG904katG1523 (nucleotide sequences 5' AGC TCG TAT GGC ACC GGA AC 3' (SEQ ID NO:16) and 5' TTG ACC TCC CAC CCG ACT TG 3' (SEQ ID NO:17)) and katG633katG983 (nucleotide sequences 5' CGG TAA GCG GGA TCT GGA GA 3' (SEQ ID NO:18) and 5' CAT TTC GTC GGG GTG TTC GT 3' (SEQ ID NO:19)). Subunits thereof that hybridize to the amplified DNA under the conditions described hereinbelow may also be used.

Polyacrylamide was obtained from National Diagnostics, Tris Borate EDTA solution (6X, cat. no. T6400), magnesium chloride, and dithiothreitol (DTT) from Sigma Chemical Company (St. Louis, Mo.), TEMED (cat. no. 161-0800) and ethidium bromide (EtBr) from Biorad, ammonium persulfate from Intermountain Sci., and nucleotides (dATP, dGTP, dCTP and dTTP, 100mM solutions) from Boehringer Mannheim Biochemicals. dUTP was obtained from Pharmacia. AmpErase™ uracil-N-glycosylase (UNG) and AmpliTaq™ were obtained from Perkin Elmer.

Restriction endonucleases were obtained as follows: MspI from Sigma (cat. no. R-4506) 10 u/μl with blue palette buffer; RsaI from New England Biochemical (cat. no. 167S) 10 u/μl with NEB buffer 1; and CfoI from Promega (cat. no. R624) 10 u/μl with buffer B.

100 mM nucleotide concentrates obtained from Boehringer Mannheim Biochemicals were used to make the dNTP stock solution, which was 1.25 mM in each nucleotide. Specifically, 10 μl each of dATP, dGTP, dCTP, and dTTP concentrates were added to 760 μl water. dNTP(U) stock solution, also 1.25 mM in each nucleotide, was made from the same 100 mM dATP, dGTP and dCTP concentrates, and 100 mM dUTP concentrate from Pharmacia. Ten μl of each of the four concentrates was added to 760 μl water to make the stock solution.

10X PCR buffer consisted of 100 mM Tris, pH 8.3, 500 mM KCl, and 15 mM MgCl₂. PCR mix "A" consisted of 1X PCR buffer, 200 μM each dATP, dGTP, dCTP, and dUTP, 1 μM each katG904 (SEQ ID NO: 16) and katG1523 (SEQ ID NO: 17) primers, 10% glycerol, 10 units/m/AmpErase™UNG, and 0.025 units/μl AmpliTaq. PCR mix B consisted of 1X PCR buffer, 200 μM each dATP, dGTP, dCTP, and dTTP, 1 μM each katG904 (SEQ ID NO: 16) and katG1523 (SEQ ID NO: 17) primers, 10% glycerol, and 0.025 units/μl AmpliTaq. PCR mix "C" consisted of 1X PCR buffer, 200 μM each dATP, dGTP, dCTP, and dUTP, 1 μM each katG633 (SEQ ID NO: 18) and katG983 (SEQ ID NO: 19) primers, 10% glycerol, 10 units/ml AmpErase™UNG, and 0.025 units/μl AmpliTaq.

Gel loading solution (Blue Juice) was obtained from Sigma (cat. no. G-2526). Gels were photographed on a UV transilluminator (UVP) with Polaroid 667 black and white film (31/4×41/4 inch) through an orange filter.

DNA extracts (target DNA) were prepared as described in Example I(A).

B. MspI RFLP Analysis.

PCR was performed by adding 2 μl of DNA extract to 48 μl PCR mix "A." Each reaction was covered with 2 drops of mineral oil. Temperature was cycled (Perkin Elmer DNA Thermo Cycler model 480) for 1 cycle of (5'--37°; 5'--95°) and 40 cycles of (1'--94°; 0.5'--60°; 0.75'--72°) and a 72° soak. MspI(10 u/μl) was diluted 1:10 in 100 mM MgCl₂. The amplified DNA (base pairs 904 through 1523) of a wild-type katG gene contains 7 MspI restriction sites (FIG. 5); of the 8 fragments produced in an MspI restriction digest, 4 are of sufficient length to be visualized using gel electrophoresis (see FIG. 5 for a restriction map). Diluted MspI (1 μl ) was mixed with 9 μl of the PCR reaction mixture containing the amplified DNA. The digest was incubated at 37° C. for 2 hours, then heated to 65° C. for 10 minutes. Subsequently, 10 μl of the digest plus 4 μl blue juice was electrophoresed on 6% polyacrylamide for 0.4 hour at 200 V. The gel was stained in EtBr (0.5 mg/ml 1XTBE) for 5 minutes and photographed.

Results are shown in FIG. 8. Lanes C, D, F, G, H, K, L, N, and Q show the wild-type genotype at codons 315 (AGC) and 463 (CGG) evidenced by 4 restriction of sufficient length to be visualized using gel electrophoresis (228, 153, 137, and 65 base pairs, respectively, see FIG. 5). Lanes M and 0 show an RFLP indicating a mutation at codon 315 that adds a new MspI restriction site, causing the 153 base pair fragment to be shortened to 132 base pair and become difficult to resolve from the 137 base pair fragment. The resulting 3 fragment pattern (65, 132/137 and 228 base pair) is indicative of an INH resistant strain. Lanes E, I and P show an RFLP indicating a mutation at codon 315 that eliminates an MspI restriction site, evidenced by the 3 visible fragments produced by cleavage versus the 4 produced by the wild-type genotype. The resulting 3 fragment pattern (153, 202, and 228 base pair) is indicative of an INH resistant strain. Lanes B and J show an RFLP indicating mutations at both codon 315 and codon 463. The resulting gain and loss of MspI restriction sites produces a distinctive 3 fragment RFLP pattern (132, 202 and 228 base pair) indicative of an INH resistant strain (see FIG. 5 for a restriction map).

C. RsaI RFLP Analysis.

PCR was performed by adding 2 μl of DNA extract to 48 μl PCR mix "B." Each reaction was covered with 2 drops of mineral oil. Temperature was cycled (Perkin Elmer DNA Thermo Cycler model 480) for 1 cycle of 2 minutes at 94° and 40 cycles of (1'--94°; 0.5'--60°; 0.75'--72°) and a 4° soak. RsaI (10 u/μl) was diluted 1:20 in 100 mM MgCl₂ /100 mM dithiothreitol. The amplified DNA (bases 904 through 1523) of a wild-type katG gene contains 2 RsaI restriction sites. Diluted RsaI (2 μl) was placed on top of the PCR reaction mixture (on the oil) and centrifuged at about 12,000× g for 10 seconds to drop the RsaI enzyme into the mixture containing the amplified DNA. The resulting mixture was incubated overnight (15-20 hours) at 37°, after which 10 μl of the digest plus 1 μl blue juice was electrophoresed on 6% polyacrylamide for 0.4 hour at 200 V. The gel was stained in EtBr (0.5 mg/ml 1XTBE) for 5 minutes and photographed.

Results are shown in FIG. 9. Lanes A, B, D, E, and F show the wild-type genotype at codon 337 (TAC), evidenced by three restriction fragments produced by cleavage at two sites. Lane C shows an RFLP indicating a mutation at codon 337 that eliminates one of the RsaI restriction sites. The resulting two fragment pattern has been observed in an INH resistant strain.

D. CfoI RFLP Analysis.

PCR was performed by adding 2 μl of DNA extract to 48 μl PCR mix "C." Each reaction was covered with 2 drops of mineral oil. Temperature was cycled (Perkin Elmer DNA Thermo Cycler model 480) for 1 cycle of (5'--37°; 5'--95°) and 40 cycles of (1'--94°; 0.5'--60°; 0.75'--72°) and a 72° soak. The amplified DNA (bases 633 through 983) of a wild-type katG gene contains 3 CfoI restriction sites. CfoI (10 u/μl) was diluted 1:5 in 100 mM MgCl₂. Diluted CfoI (1 μl) was mixed with 9 μl of the PCR reaction mixture containing the amplified DNA. The digest was incubated at 37° for 2 hours, then heated to 65° C. for 10 minutes. Subsequently, 10 μl of the digest plus 4 μl blue juice was electrophoresed on 6% polyacrylamide for 0.4 hour at 200 V. The gel was stained in EtBr (0.5 mg/ml 1XTBE) for 5 minutes and photographed.

Results are shown in FIG. 10. Lanes A-C show the wild-type genotype at codon 264 (GCG), evidenced by 4 restriction fragments produced by cleavage at three sites. Lane E shows an RFLP indicating a mutation at codon 264 that eliminates one of the CfoI restriction sites. The resulting three fragment pattern has been observed in an INH resistant strain

All publications, patents and patent documents are incorporated by reference herein, as though individually incorporated by reference. The invention has been described with reference to various specific and preferred embodiments and techniques. However, it should be understood that many variations and modifications may be made while remaining within the spirit and scope of the invention.

    __________________________________________________________________________     SEQUENCE LISTING     (1) GENERAL INFORMATION:     (iii) NUMBER OF SEQUENCES: 22     (2) INFORMATION FOR SEQ ID NO:1:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 2235 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:     AGGAATGCTGTGCCCGAGCAACACCCACCCATTACAGAAACCACCACCGGAGCCGCTAGC60     AACGGCTGTCCCGTCGTGGGTCATATGAAATACCCCGTCGAGGGCGGCGGAAACCAGGAC120     TGGTGGCCCAACCGGCTCAATCTGAAGGTACTGCACCAAAACCCGGCCGTCGCTGACCCG180     ATGGGTGCGGCGTTCGACTATGCCGCGGAGGTCGCGACCATCGACGTTGACGCCCTGACG240     CGGGACATCGAGGAAGTGATGACCACCTCGCAGCCGTGGTGGCCCGCCGACTACGGCCAC300     TACGGGCCGCTGTTTATCCGGATGGCGTGGCACGCTGCCGGCACCTACCGCATCCACGAC360     GGCCGCGGCGGCGCCGGGGGCGGCATGCAGCGGTTCGCGCCGCTTAACAGCTGGCCCGAC420     AACGCCAGCTTGGACAAGGCGCGCCGGCTGCTGTGGCCGGTCAAGAAGAAGTACGGCAAG480     AAGCTCTCATGGGCGGACCTGATTGTTTTCGCCGGCAACTGCGCGCTGGAATCGATGGGC540     TTCAAGACGTTCGGGTTCGGCTTCGGCCGGGTCGACCAGTGGGAGCCCGATGAGGTCTAT600     TGGGGCAAGGAAGCCACCTGGCTCGGCGATGAGCGTTACAGCGGTAAGCGGGATCTGGAG660     AACCCGCTGGCCGCGGTGCAGATGGGGCTGATCTACGTGAACCCGGAGGGGCCGAACGGC720     AACCCGGACCCCATGGCCGCGGCGGTCGACATTCGCGAGACGTTTCGGCGCATGGCCATG780     AACGACGTCGAAACAGCGGCGCTGATCGTCGGCGGTCACACTTTCGGTAAGACCCATGGC840     GCCGGCCCGGCCGATCTGGTCGGCCCCGAACCCGAGGCTGCTCCGCTGGAGCAGATGGGC900     TTGGGCTGGAAGAGCTCGTATGGCACCGGAACCGGTAAGGACGCGATCACCAGCGGCATC960     GAGGTCGTATGGACGAACACCCCGACGAAATGGGACAACAGTTTCCTCGAGATCCTGTAC1020     GGCTACGAGTGGGAGCTGACGAAGAGCCCTGCTGGCGCTTGGCAATACACCGCCAAGGAC1080     GGCGCCGGTGCCGGCACCATCCCGGACCCGTTCGGCGGGCCAGGGCGCTCCCCGACGATG1140     CTGGCCACTGACCTCTCGCTGCGGGTGGATCCGATCTATGAGCGGATCACGCGTCGCTGG1200     CTGGAACACCCCGAGGAATTGGCCGACGAGTTCGCCAAGGCCTGGTACAAGCTGATCCAC1260     CGAGACATGGGTCCCGTTGCGAGATACCTTGGGCCGCTGGTCCCCAAGCAGACCCTGCTG1320     TGGCAGGATCCGGTCCCTGCGGTCAGCCACGACCTCGTCGGCGAAGCCGAGATTGCCAGC1380     CTTAAGAGCCAGATCCGGGCATCGGGATTGACTGTCTCACAGCTAGTTTCGACCGCATGG1440     GCGGCGGCGTCGTCGTTCCGTGGTAGCGACAAGCGCGGCGGCGCCAACGGTGGTCGCATC1500     CGCCTGCAGCCACAAGTCGGGTGGGAGGTCAACGACCCCGACGGGGATCTGCGCAAGGTC1560     ATTCGCACCCTGGAAGAGATCCAGGAGTCATTCAACTCCGCGGCGCCGGGGAACATCAAA1620     GTGTCCTTCGCCGACCTCGTCGTGCTCGGTGGCTGTGCCGCCATAGAGAAAGCAGCAAAG1680     GCGGCTGGCCACAACATCACGGTGCCCTTCACCCCGGGCCGCACGGATGCGTCGCAGGAA1740     CAAACCGACGTGGAATCCTTTGCCGTGCTGGAGCCCAAGGCAGATGGCTTCCGAAACTAC1800     CTCGGAAAGGGCAACCCGTTGCCGGCCGAGTACATGCTGCTCGACAAGGCGAACCTGCTT1860     ACGCTCAGTGCCCCTGAGATGACGGTGCTGGTAGGTGGCCTGCGCGTCCTCGGGCAAACT1920     ACAAGCGCTTACCGCTGGGCGTGTTCACCGAGGCCTCCGAGTCACTGACCAACGACTTCT1980     TCGTGAACCTGCTCGACATGGGTATCACCTGGGAGCCCTCGCCAGCAGATGACGGGACCT2040     ACCAGGGCAAGGATGGCAGTGGCAAGGTGAAGTGGACCGGCAGCCGCGTGGACCTGGTCT2100     TCGGGTCCAACTCGGAGTTGCGGGCGCTTGTCGAGGTCTATGGCGCCGATGACGCGCAGC2160     CGAAGTTCGTGCAGGACTTCGTCGCTGCCTGGGACAAGGTGATGAACCTCGACAGGTTCG2220     ACGTGCGCTGATTCG2235     (2) INFORMATION FOR SEQ ID NO:2:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 2221 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:     AGGAATGCTGTGCCCGAGCAACACCCACCCATTACAGAAACCACCACCGGAGCCGCTAGC60     AACGGCTGTCCCGTCGTGGGTCATATGAAATACCCCGTCGAGGGCGGCGGAAACCAGGAC120     TGGTGGCCCAACCGGCTCAATCTGAAGGTACTGCACCAAAACCCGGCCGTCGCTGACCCG180     ATGGGTGCGGCGTTCGACTATGCCGCGGAGGTCGCGACCAGTCGACTTGACGCCCTGACG240     CGGGACATCGAGGAAGTGATGACCACCTCGCAGCCGTGGTGGCCCGCCGACTACGGCCAC300     TACGGGCCGCTGTTTATCCGGATGGCGTGGCACGCTGCCGGCACCTACCGCATCCACGAC360     GGCCGCGGCGGCGCCGGGGGCGGCATGCAGCGGTTCGCGCCGCTTAACAGCTGGCCCGAC420     AACGCCAGCTTGGACAAGGCGCGCCGGCTGCTGTGGCCGGTCAAGAAGAAGTACGGCAAG480     AAGCTCTCATGGGCGGACCTGATTGTTTTCGCCGGCAACCGCTGCGCTCGGAATCGATGG540     GCTTCAAGACGTTCGGGTTCGGCTTCGGGCGTCGACCAGTGGGAGACCGATGAGGTCTAT600     TGGGGCAAGGAAGCCACCTGGCTCGGCGATGACGGTTACAGCGTAAGCGATCTGGAGAAC660     CCGCTGGCCGCGGTGCAGATGGGGCTGATCTACGTGAACCCGGAGGCGCCGAACGGCAAC720     CCGGACCCCATGGCCGCGGCGGTCGACATTCGCGAGACGTTTCGGCGCATGGCCATGAAC780     GACGTCGAAACAGCGGCGCTGATCGTCGGCGGTCACACTTTCGGTAAGACCCATGGCGCC840     GGCCCGGCCGATCTGGTCGGCCCCGAACCCGAGGCTGCTCCGCTGGAGCAGATGGGCTTG900     GGCTGGAAGAGCTCGTATGGCACCGGAACCGGTAAGGACGCGATCACCAGCGGCATCGAG960     GTCGTATGGACGAACACCCCGACGAAATGGGACAACAGTTTCCTCGAGATCCTGTACGGC1020     TACGAGTGGGAGCTGACGAAGAGCCCTGCTGGCGCTTGGCAATACACCGCCAAGGACGGC1080     GCCGGTGCCGGCACCATCCCGGACCCGTTCGGCGGGCCAGGGCGCTCCCCGACGATGCTG1140     GCCACTGACCTCTCGCTGCGGGTGGATCCGATCTATGAGCGGATCACGCGTCGCTGGCTG1200     GAACACCCCGAGGAATTGGCCGACGAGTTCCGCAAGGCCTGGTACAAGCTGATCCACCGA1260     GACATGGGTCCCGTTGCGAGATACCTTGGGCCGCTGGTCCCCAAGCAGACCCTGCTGTGG1320     CAGGATCCGGTCCCTGCGGTCAGCACGACCTCGTCGGCGAAGCAGATTGCCAGCCTTAAG1380     AGCCAGATCCGGGCATCGGGATTGACTGTCTCACAGCTAGTTTCGACCGCATGGGCGGCG1440     GCGTCGTCGTTCCGTGGTAGCGACAAGCGCGGCGGCGCCAACGGTGGTCGCATCCGCCTG1500     CAGCCACAAGTCGGGTGGGAGGTCAACGACCCCGACGGATCTGCGCAAGGTCATTCGCAC1560     CCTGAAGAGATCCAGGAGTCATTCACTCGGCGCGGGAACATCAAAGTGTCCTTCGCCGAC1620     CTCGTCGTGCTCGGTGGCTGTGCGCCACTAGAGAAAGCAGCAAAGGCGGCTGGCCACAAC1680     ATCACGGTGCCCTTCACCCCGGGCCCGCACGATGCGTCGCAGGAACAAACCGACGTGGAA1740     TCCTTTGCCGTGCTGGAGCCCAAGGCAGATGGCTTCCGAAACTACCTCGGAAAGGGCAAC1800     CGTTGCCGGCCGAGTACATCGCTGCTCGACAAGGCGAACCTGCTTACGCTCAGTGCCCCT1860     GAGATGACGGTGCTGGTAGGTGGCCTGCGCGTCCTCGGCGCAAACTACAAGCGCTTACCG1920     CTGGGCGTGTTCACCGAGGCCTCCGAGTCACTGACCAACGACTTCTTCGTGAACCTGCTC1980     GACATGGGTATCACCTGGGAGCCCTCGCCAGCAGATGACGGGACCTACCAGGGCAAGGAT2040     GGCAGTGGCAAGGTGAAGTGGACCGGCAGCCGCGTGGACCTGGTCTTCGGGTCCAACTCG2100     GAGTTGCGGGCGCTTGTCGAGGTCTATGCGCCGATGACGCGGCAGGCGAAGTTCGTGACA2160     GGATTCGTCGCTGCGTGGGACAAGGTGATGAACCTCGACAGGTTCGACGTGCGCTGATTC2220     G2221     (2) INFORMATION FOR SEQ ID NO:3:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 30 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:     TCGGACCATAACGGCTTCCTGTTGGACGAG30     (2) INFORMATION FOR SEQ ID NO:4:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 30 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:     AATCTGCTTCGCCGACGAGGTCGTGCTGAC30     (2) INFORMATION FOR SEQ ID NO:5:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 30 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:     CACCCCGACGAAATGGGACAACAGTTTCCT30     (2) INFORMATION FOR SEQ ID NO:6:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 30 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:     GGGTCTGACAAATCGCGCCGGGCAAACACC30     (2) INFORMATION FOR SEQ ID NO:7:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 740 amino acids     (B) TYPE: amino acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: protein     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:     ValProGluGlyHisProProIleThrGluThrThrThrGlyAlaAla     151015     SerAsnGlyCysProValValGlyHisMetLysTyrProValGluGly     202530     GlyGlyAsnGlnAspTrpTrpProAsnArgLeuAsnLeuLysValLeu     354045     HisGlnAsnProAlaValAlaAspProMetGlyAlaAlaPheAspTyr     505560     AlaAlaGluValAlaThrIleAspValAspAlaLeuThrArgAspIle     65707580     GluGluValMetThrThrSerGlnProTrpTrpProAlaAspTyrGly     859095     HisTyrGlyProLeuPheIleArgMetAlaTrpHisAlaAlaGlyThr     100105110     TyrArgIleHisAspGlyArgGlyGlyAlaGlyGlyGlyMetGlnArg     115120125     PheAlaProLeuAsnSerTrpProAspAsnAlaSerLeuAspLysAla     130135140     ArgArgLeuLeuTrpProValLysLysLysTyrGlyLysLysLeuSer     145150155160     TrpAlaAspLeuIleValPheAlaGlyAsnCysAlaLeuGluSerMet     165170175     GlyPheLysThrPheGlyPheGlyPheGlyArgValAspGlnTrpGlu     180185190     ProAspGluValTyrTrpGlyLysGluAlaThrTrpLeuGlyAspGlu     195200205     ArgTyrSerGlyLysArgAspLeuGluAsnProLeuAlaAlaValGln     210215220     MetGlyLeuIleTyrValAsnProGluGlyProAsnGlyAsnProAsp     225230235240     ProMetAlaAlaAlaValAspIleArgGluThrPheArgArgMetAla     245250255     MetAsnAspValGluThrAlaAlaLeuIleValGlyGlyHisThrPhe     260265270     GlyLysThrHisGlyAlaGlyProAlaAspLeuValGlyProGluPro     275280285     GluAlaAlaProLeuGluGlnMetGlyLeuGlyTrpLysSerSerTyr     290295300     GlyThrGlyThrGlyLysAspAlaIleThrSerGlyIleGluValVal     305310315320     TrpThrAsnThrProThrLysTrpAspAsnSerPheLeuGluIleLeu     325330335     TyrGlyTyrGluTrpGluLeuThrLysSerProAlaGlyAlaTrpGln     340345350     TyrThrAlaLysAspGlyAlaGlyAlaGlyThrIleProAspProPhe     355360365     GlyGlyProGlyArgSerProThrMetLeuAlaThrAspLeuSerLeu     370375380     ArgValAspProIleTyrGluArgIleThrArgArgTrpLeuGluHis     385390395400     ProGluGluLeuAlaAspGluPheAlaLysAlaTrpTyrLysLeuIle     405410415     HisArgAspMetGlyProValAlaArgTyrLeuGlyProLeuValPro     420425430     LysGlnThrLeuLeuTrpGlnAspProValProAlaValSerHisAsp     435440445     LeuValGlyGluAlaGluIleAlaSerLeuLysSerGlnIleArgAla     450455460     SerGlyLeuThrValSerGlnLeuValSerThrAlaTrpAlaAlaAla     465470475480     SerSerPheArgGlySerAspLysArgGlyGlyAlaAsnGlyGlyArg     485490495     IleArgLeuGlnProGlnValGlyTrpGluValAsnAspProAspGly     500505510     AspLeuArgLysValIleArgThrLeuGluGluIleGlnGluSerPhe     515520525     AsnSerAlaAlaProGlyAsnIleLysValSerPheAlaAspLeuVal     530535540     ValLeuGlyGlyCysAlaAlaIleGluLysAlaAlaLysAlaAlaGly     545550555560     HisAsnIleThrValProPheThrProGlyArgThrAspAlaSerGln     565570575     GluGlnThrAspValGluSerPheAlaValLeuGluProLysAlaAsp     580585590     GlyPheArgAsnTyrLeuGlyLysGlyAsnProLeuProAlaGluTyr     595600605     MetLeuLeuAspLysAlaAsnLeuLeuThrLeuSerAlaProGluMet     610615620     ThrValLeuValGlyGlyLeuArgValLeuGlyAlaAsnTyrLysArg     625630635640     LeuProLeuGlyValPheThrGluAlaSerGluSerLeuThrAsnAsp     645650655     PhePheValAsnLeuLeuAspMetGlyIleThrTrpGluProSerPro     660665670     AlaAspAspGlyThrTyrGlnGlyLysAspGlySerGlyLysValLys     675680685     TrpThrGlySerArgValAspLeuValPheGlySerAsnSerGluLeu     690695700     ArgAlaLeuValGluValTyrGlyAlaAspAspAlaGlnProLysPhe     705710715720     ValGlnAspPheValAlaAlaTrpAspLysValMetAsnLeuAspArg     725730735     PheAspValArg     740     (2) INFORMATION FOR SEQ ID NO:8:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 27 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:     GTCGAAACAGCGGCGCTGATCGTCGGC27     (2) INFORMATION FOR SEQ ID NO:9:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 27 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:     GTCGAAACAGCGACGCTGATCGTCGGC27     (2) INFORMATION FOR SEQ ID NO:10:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 27 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:     CTCGAGATCCTGTACGGCTACGAGTGG27     (2) INFORMATION FOR SEQ ID NO:11:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 27 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:     CTCGAGATCCTGTGCGGCTACGAGTGG27     (2) INFORMATION FOR SEQ ID NO:12:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 27 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:     GACGCGATCACCAGCGGCATCGAGGTC27     (2) INFORMATION FOR SEQ ID NO:13:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 27 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:     GACGCGATCACCACCGGCATCGAGGTC27     (2) INFORMATION FOR SEQ ID NO:14:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 27 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:     AAGAGCCAGATCCGGGCATCGGGATTG27     (2) INFORMATION FOR SEQ ID NO:15:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 27 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:     AAGAGCCAGATCCTGGCATCGGGATTG27     (2) INFORMATION FOR SEQ ID NO:16:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 20 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:     AGCTCGTATGGCACCGGAAC20     (2) INFORMATION FOR SEQ ID NO:17:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 20 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:     TTGACCTCCCACCCGACTTG20     (2) INFORMATION FOR SEQ ID NO:18:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 20 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:     CGGTAAGCGGGATCTGGAGA20     (2) INFORMATION FOR SEQ ID NO:19:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 20 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:     CATTTCGTCGGGGTGTTCGT20     (2) INFORMATION FOR SEQ ID NO:20:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 2331 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA     (ix) FEATURE:     (A) NAME/KEY: CDS     (B) LOCATION: 70..2289     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:     CGATATCCGACACTTCGCGATCACATCCGTGATCACAGCCCGATAACACCAACTCCTGGA60     AGGAATGCTGTGCCCGAGCAACACCCACCCATTACAGAAACCACCACC108     ValProGluGlnHisProProIleThrGluThrThrThr     1510     GGAGCCGCTAGCAACGGCTGTCCCGTCGTGGGTCATATGAAATACCCC156     GlyAlaAlaSerAsnGlyCysProValValGlyHisMetLysTyrPro     152025     GTCGAGGGCGGCGGAAACCAGGACTGGTGGCCCAACCGGCTCAATCTG204     ValGluGlyGlyGlyAsnGlnAspTrpTrpProAsnArgLeuAsnLeu     30354045     AAGGTACTGCACCAAAACCCGGCCGTCGCTGACCCGATGGGTGCGGCG252     LysValLeuHisGlnAsnProAlaValAlaAspProMetGlyAlaAla     505560     TTCGACTATGCCGCGGAGGTCGCGACCATCGACGTTGACGCCCTGACG300     PheAspTyrAlaAlaGluValAlaThrIleAspValAspAlaLeuThr     657075     CGGGACATCGAGGAAGTGATGACCACCTCGCAGCCGTGGTGGCCCGCC348     ArgAspIleGluGluValMetThrThrSerGlnProTrpTrpProAla     808590     GACTACGGCCACTACGGGCCGCTGTTTATCCGGATGGCGTGGCACGCT396     AspTyrGlyHisTyrGlyProLeuPheIleArgMetAlaTrpHisAla     95100105     GCCGGCACCTACCGCATCCACGACGGCCGCGGCGGCGCCGGGGGCGGC444     AlaGlyThrTyrArgIleHisAspGlyArgGlyGlyAlaGlyGlyGly     110115120125     ATGCAGCGGTTCGCGCCGCTTAACAGCTGGCCCGACAACGCCAGCTTG492     MetGlnArgPheAlaProLeuAsnSerTrpProAspAsnAlaSerLeu     130135140     GACAAGGCGCGCCGGCTGCTGTGGCCGGTCAAGAAGAAGTACGGCAAG540     AspLysAlaArgArgLeuLeuTrpProValLysLysLysTyrGlyLys     145150155     AAGCTCTCATGGGCGGACCTGATTGTTTTCGCCGGCAACTGCGCGCTG588     LysLeuSerTrpAlaAspLeuIleValPheAlaGlyAsnCysAlaLeu     160165170     GAATCGATGGGCTTCAAGACGTTCGGGTTCGGCTTCGGCCGGGTCGAC636     GluSerMetGlyPheLysThrPheGlyPheGlyPheGlyArgValAsp     175180185     CAGTGGGAGCCCGATGAGGTCTATTGGGGCAAGGAAGCCACCTGGCTC684     GlnTrpGluProAspGluValTyrTrpGlyLysGluAlaThrTrpLeu     190195200205     GGCGATGAGCGTTACAGCGGTAAGCGGGATCTGGAGAACCCGCTGGCC732     GlyAspGluArgTyrSerGlyLysArgAspLeuGluAsnProLeuAla     210215220     GCGGTGCAGATGGGGCTGATCTACGTGAACCCGGAGGGGCCGAACGGC780     AlaValGlnMetGlyLeuIleTyrValAsnProGluGlyProAsnGly     225230235     AACCCGGACCCCATGGCCGCGGCGGTCGACATTCGCGAGACGTTTCGG828     AsnProAspProMetAlaAlaAlaValAspIleArgGluThrPheArg     240245250     CGCATGGCCATGAACGACGTCGAAACAGCGGCGCTGATCGTCGGCGGT876     ArgMetAlaMetAsnAspValGluThrAlaAlaLeuIleValGlyGly     255260265     CACACTTTCGGTAAGACCCATGGCGCCGGCCCGGCCGATCTGGTCGGC924     HisThrPheGlyLysThrHisGlyAlaGlyProAlaAspLeuValGly     270275280285     CCCGAACCCGAGGCTGCTCCGCTGGAGCAGATGGGCTTGGGCTGGAAG972     ProGluProGluAlaAlaProLeuGluGlnMetGlyLeuGlyTrpLys     290295300     AGCTCGTATGGCACCGGAACCGGTAAGGACGCGATCACCAGCGGCATC1020     SerSerTyrGlyThrGlyThrGlyLysAspAlaIleThrSerGlyIle     305310315     GAGGTCGTATGGACGAACACCCCGACGAAATGGGACAACAGTTTCCTC1068     GluValValTrpThrAsnThrProThrLysTrpAspAsnSerPheLeu     320325330     GAGATCCTGTACGGCTACGAGTGGGAGCTGACGAAGAGCCCTGCTGGC1116     GluIleLeuTyrGlyTyrGluTrpGluLeuThrLysSerProAlaGly     335340345     GCTTGGCAATACACCGCCAAGGACGGCGCCGGTGCCGGCACCATCCCG1164     AlaTrpGlnTyrThrAlaLysAspGlyAlaGlyAlaGlyThrIlePro     350355360365     GACCCGTTCGGCGGGCCAGGGCGCTCCCCGACGATGCTGGCCACTGAC1212     AspProPheGlyGlyProGlyArgSerProThrMetLeuAlaThrAsp     370375380     CTCTCGCTGCGGGTGGATCCGATCTATGAGCGGATCACGCGTCGCTGG1260     LeuSerLeuArgValAspProIleTyrGluArgIleThrArgArgTrp     385390395     CTGGAACACCCCGAGGAATTGGCCGACGAGTTCGCCAAGGCCTGGTAC1308     LeuGluHisProGluGluLeuAlaAspGluPheAlaLysAlaTrpTyr     400405410     AAGCTGATCCACCGAGACATGGGTCCCGTTGCGAGATACCTTGGGCCG1356     LysLeuIleHisArgAspMetGlyProValAlaArgTyrLeuGlyPro     415420425     CTGGTCCCCAAGCAGACCCTGCTGTGGCAGGATCCGGTCCCTGCGGTC1404     LeuValProLysGlnThrLeuLeuTrpGlnAspProValProAlaVal     430435440445     AGCCACGACCTCGTCGGCGAAGCCGAGATTGCCAGCCTTAAGAGCCAG1452     SerHisAspLeuValGlyGluAlaGluIleAlaSerLeuLysSerGln     450455460     ATCCGGGCATCGGGATTGACTGTCTCACAGCTAGTTTCGACCGCATGG1500     IleArgAlaSerGlyLeuThrValSerGlnLeuValSerThrAlaTrp     465470475     GCGGCGGCGTCGTCGTTCCGTGGTAGCGACAAGCGCGGCGGCGCCAAC1548     AlaAlaAlaSerSerPheArgGlySerAspLysArgGlyGlyAlaAsn     480485490     GGTGGTCGCATCCGCCTGCAGCCACAAGTCGGGTGGGAGGTCAACGAC1596     GlyGlyArgIleArgLeuGlnProGlnValGlyTrpGluValAsnAsp     495500505     CCCGACGGGGATCTGCGCAAGGTCATTCGCACCCTGGAAGAGATCCAG1644     ProAspGlyAspLeuArgLysValIleArgThrLeuGluGluIleGln     510515520525     GAGTCATTCAACTCCGCGGCGCCGGGGAACATCAAAGTGTCCTTCGCC1692     GluSerPheAsnSerAlaAlaProGlyAsnIleLysValSerPheAla     530535540     GACCTCGTCGTGCTCGGTGGCTGTGCCGCCATAGAGAAAGCAGCAAAG1740     AspLeuValValLeuGlyGlyCysAlaAlaIleGluLysAlaAlaLys     545550555     GCGGCTGGCCACAACATCACGGTGCCCTTCACCCCGGGCCGCACGGAT1788     AlaAlaGlyHisAsnIleThrValProPheThrProGlyArgThrAsp     560565570     GCGTCGCAGGAACAAACCGACGTGGAATCCTTTGCCGTGCTGGAGCCC1836     AlaSerGlnGluGlnThrAspValGluSerPheAlaValLeuGluPro     575580585     AAGGCAGATGGCTTCCGAAACTACCTCGGAAAGGGCAACCCGTTGCCG1884     LysAlaAspGlyPheArgAsnTyrLeuGlyLysGlyAsnProLeuPro     590595600605     GCCGAGTACATGCTGCTCGACAAGGCGAACCTGCTTACGCTCAGTGCC1932     AlaGluTyrMetLeuLeuAspLysAlaAsnLeuLeuThrLeuSerAla     610615620     CCTGAGATGACGGTGCTGGTAGGTGGCCTGCGCGTCCTCGGCGCAAAC1980     ProGluMetThrValLeuValGlyGlyLeuArgValLeuGlyAlaAsn     625630635     TACAAGCGCTTACCGCTGGGCGTGTTCACCGAGGCCTCCGAGTCACTG2028     TyrLysArgLeuProLeuGlyValPheThrGluAlaSerGluSerLeu     640645650     ACCAACGACTTCTTCGTGAACCTGCTCGACATGGGTATCACCTGGGAG2076     ThrAsnAspPhePheValAsnLeuLeuAspMetGlyIleThrTrpGlu     655660665     CCCTCGCCAGCAGATGACGGGACCTACCAGGGCAAGGATGGCAGTGGC2124     ProSerProAlaAspAspGlyThrTyrGlnGlyLysAspGlySerGly     670675680685     AAGGTGAAGTGGACCGGCAGCCGCGTGGACCTGGTCTTCGGGTCCAAC2172     LysValLysTrpThrGlySerArgValAspLeuValPheGlySerAsn     690695700     TCGGAGTTGCGGGCGCTTGTCGAGGTCTATGGCGCCGATGACGCGCAG2220     SerGluLeuArgAlaLeuValGluValTyrGlyAlaAspAspAlaGln     705710715     CCGAAGTTCGTGCAGGACTTCGTCGCTGCCTGGGACAAGGTGATGAAC2268     ProLysPheValGlnAspPheValAlaAlaTrpAspLysValMetAsn     720725730     CTCGACAGGTTCGACGTGCGCTGATTCGGGTTGATCGGCCCTGCCCGCCGA2319     LeuAspArgPheAspValArg     735740     TCAACCACAACC2331     (2) INFORMATION FOR SEQ ID NO:21:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 740 amino acids     (B) TYPE: amino acid     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: protein     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:     ValProGluGlnHisProProIleThrGluThrThrThrGlyAlaAla     151015     SerAsnGlyCysProValValGlyHisMetLysTyrProValGluGly     202530     GlyGlyAsnGlnAspTrpTrpProAsnArgLeuAsnLeuLysValLeu     354045     HisGlnAsnProAlaValAlaAspProMetGlyAlaAlaPheAspTyr     505560     AlaAlaGluValAlaThrIleAspValAspAlaLeuThrArgAspIle     65707580     GluGluValMetThrThrSerGlnProTrpTrpProAlaAspTyrGly     859095     HisTyrGlyProLeuPheIleArgMetAlaTrpHisAlaAlaGlyThr     100105110     TyrArgIleHisAspGlyArgGlyGlyAlaGlyGlyGlyMetGlnArg     115120125     PheAlaProLeuAsnSerTrpProAspAsnAlaSerLeuAspLysAla     130135140     ArgArgLeuLeuTrpProValLysLysLysTyrGlyLysLysLeuSer     145150155160     TrpAlaAspLeuIleValPheAlaGlyAsnCysAlaLeuGluSerMet     165170175     GlyPheLysThrPheGlyPheGlyPheGlyArgValAspGlnTrpGlu     180185190     ProAspGluValTyrTrpGlyLysGluAlaThrTrpLeuGlyAspGlu     195200205     ArgTyrSerGlyLysArgAspLeuGluAsnProLeuAlaAlaValGln     210215220     MetGlyLeuIleTyrValAsnProGluGlyProAsnGlyAsnProAsp     225230235240     ProMetAlaAlaAlaValAspIleArgGluThrPheArgArgMetAla     245250255     MetAsnAspValGluThrAlaAlaLeuIleValGlyGlyHisThrPhe     260265270     GlyLysThrHisGlyAlaGlyProAlaAspLeuValGlyProGluPro     275280285     GluAlaAlaProLeuGluGlnMetGlyLeuGlyTrpLysSerSerTyr     290295300     GlyThrGlyThrGlyLysAspAlaIleThrSerGlyIleGluValVal     305310315320     TrpThrAsnThrProThrLysTrpAspAsnSerPheLeuGluIleLeu     325330335     TyrGlyTyrGluTrpGluLeuThrLysSerProAlaGlyAlaTrpGln     340345350     TyrThrAlaLysAspGlyAlaGlyAlaGlyThrIleProAspProPhe     355360365     GlyGlyProGlyArgSerProThrMetLeuAlaThrAspLeuSerLeu     370375380     ArgValAspProIleTyrGluArgIleThrArgArgTrpLeuGluHis     385390395400     ProGluGluLeuAlaAspGluPheAlaLysAlaTrpTyrLysLeuIle     405410415     HisArgAspMetGlyProValAlaArgTyrLeuGlyProLeuValPro     420425430     LysGlnThrLeuLeuTrpGlnAspProValProAlaValSerHisAsp     435440445     LeuValGlyGluAlaGluIleAlaSerLeuLysSerGlnIleArgAla     450455460     SerGlyLeuThrValSerGlnLeuValSerThrAlaTrpAlaAlaAla     465470475480     SerSerPheArgGlySerAspLysArgGlyGlyAlaAsnGlyGlyArg     485490495     IleArgLeuGlnProGlnValGlyTrpGluValAsnAspProAspGly     500505510     AspLeuArgLysValIleArgThrLeuGluGluIleGlnGluSerPhe     515520525     AsnSerAlaAlaProGlyAsnIleLysValSerPheAlaAspLeuVal     530535540     ValLeuGlyGlyCysAlaAlaIleGluLysAlaAlaLysAlaAlaGly     545550555560     HisAsnIleThrValProPheThrProGlyArgThrAspAlaSerGln     565570575     GluGlnThrAspValGluSerPheAlaValLeuGluProLysAlaAsp     580585590     GlyPheArgAsnTyrLeuGlyLysGlyAsnProLeuProAlaGluTyr     595600605     MetLeuLeuAspLysAlaAsnLeuLeuThrLeuSerAlaProGluMet     610615620     ThrValLeuValGlyGlyLeuArgValLeuGlyAlaAsnTyrLysArg     625630635640     LeuProLeuGlyValPheThrGluAlaSerGluSerLeuThrAsnAsp     645650655     PhePheValAsnLeuLeuAspMetGlyIleThrTrpGluProSerPro     660665670     AlaAspAspGlyThrTyrGlnGlyLysAspGlySerGlyLysValLys     675680685     TrpThrGlySerArgValAspLeuValPheGlySerAsnSerGluLeu     690695700     ArgAlaLeuValGluValTyrGlyAlaAspAspAlaGlnProLysPhe     705710715720     ValGlnAspPheValAlaAlaTrpAspLysValMetAsnLeuAspArg     725730735     PheAspValArg     740     (2) INFORMATION FOR SEQ ID NO:22:     (i) SEQUENCE CHARACTERISTICS:     (A) LENGTH: 15 base pairs     (B) TYPE: nucleic acid     (C) STRANDEDNESS: single     (D) TOPOLOGY: linear     (ii) MOLECULE TYPE: DNA     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:     GACGCNNNNNNNNNN15     __________________________________________________________________________ 

What is claimed is:
 1. A method for determining the susceptibility of a strain of M. tuberculosis to isoniazid comprising determining whether or not the DNA of said strain has an NciI-MspI restriction site comprising the codon corresponding to codon 463 of the M. tuberculosis katG gene consensus sequence depicted in FIG. 7 (SEQ ID NO:20), wherein the absence of said restriction site is indicative of an INH resistant strain.
 2. The method of claim 1 which comprises the step of:(a) amplifying a portion of the katG gene of an M. tuberculosis isolate to yield a detectable amount of DNA comprising the nucleotide position occupied by base 1457 of the M. tuberculosis katG gene consensus sequence depicted in FIG. 7 (SEQ ID NO:20); (b) cleaving the amplified DAN with a restriction endonuclease which cleaves at an NciI-MspI restriction site to yield at least one DNA fragment; and (c) employing the technique of gel electrophoresis to determine whether the number and location of the DNA fragments is indicative of the absence of an NciI-MspI restriction site comprising codon 463 of said katG gene, wherein said absence is indicative of an INH resistant strain of M. tuberculosis in said isolate.
 3. The method of claim 2 wherein the amplified DNA comprises 4 NciI-MspI restriction sites prior to cleavage.
 4. The method of claim 1 wherein polymerase chain reaction (PCR) is employed to amplify DNA from the katG gene of the isolate of M. tuberculosis to be assayed.
 5. The method of claim 4 wherein said DNA is amplified employing two oligonucleotide primers of the sequence (5' CACCCCACGAAATGGGACAACAGTTTCCT 3') (SEQ ID NO:5) and (5' GGGTCTGACAAATCGCGCCGGGCAAACACC 3') (SEQ ID NO:6), or subunits thereof, in the PCR to yield a 1435 base pair subunit of the katG gene.
 6. A method for determining the susceptibility of a strain of M. tuberculosis to isoniazid comprising determining whether or not the DNA comprising the katG gene of said strain has a mutated base, which mutated base is a T at the nucleotide position occupied by base 1457 in codon 463 of the M. tuberculosis katG gene consensus sequence, which codon 463 is represented by bases 1456 through 1458 depicted in FIG. 7 (SEQ ID NO:20), or a C at the nucleotide position occupied by base 1013 in codon 315 of the M. tuberculosis katG gene consensus sequence, which codon 315 is represented by bases 1012 through 1014 depicted in FIG. 7 (SEQ ID NO:20), and wherein said mutated base is indicative of an INH resistant strain.
 7. The method of claim 6 comprising the steps of:(a) selecting a restriction endonuclease which cleaves at either a wild-type restriction site or a corresponding mutant restriction site present on DNA comprising an M. tuberculosis katG gene, but not at both of said wild-type and corresponding mutant sites, wherein(1) said wild-type restriction site comprises(i) a G at the nucleotide position occupied by base 1457 in codon 463 of the M. tuberculosis katG gene consensus sequence, which codon 463 is represented by bases 1456 through 1458 depicted in FIG. 7 (SEQ ID NO:20), or (ii) a G at the nucleotide position occupied by base 1013 in codon 315 of the M. tuberculosis katG gene consensus sequence, which codon 315 is represented by bases 1012 through 1014 depicted in FIG. 7 (SEQ ID NO:20), and wherein (2) said corresponding mutant restriction site comprises(i) a T at the nucleotide position occupied by base 1457 in codon 463 of the M. tuberculosis katG gene consensus sequence, which codon 463 is represented by bases 1456 through 1458 depicted in FIG. 7 (SEQ ID NO:20), or (ii) a C at the nucleotide position occupied by base 1013 in codon 315 of the M. tuberculosis katG gene consensus sequence, which codon 315 is represented by bases 1012 through 1014 depicted in FIG. 7 (SEQ 1D NO:20); (b) amplifying a portion of the katG gene of an M. tuberculosis isolate to yield a detectable amount of DNA comprising the nucleotide positions occupied by said wild-type restriction site or corresponding mutant restriction site, which site is cleaved by the selected restriction endonuclease; (c) cleaving the amplified DNA with the selected restriction endonuclease to yield at least one DNA fragment; and (d) employing the technique of gel electrophoresis to determine whether the number and location of the DNA fragments is indicative of the presence of said mutated base at the positions occupied by base 1457 or base 1013 of the M. tuberculosis katG gene consensus sequence depicted in FIG. 7 (SEQ ID NO:20), which presence of a mutated base is indicative of an isoniazid resistant strain of M. tuberculosis in said isolate.
 8. The method of claim 7, wherein the amplified DNA of the M. tuberculosis isolate in step (b) further comprises at least one additional restriction site, which restriction site is cleaved by the selected restriction endonuclease.
 9. The method of claim 7, further comprising(e) amplifying a portion of the wild-type M. tuberculosis katG gene equivalent to the portion amplified in step (b), which wild-type gene has the consensus sequence depicted in FIG. 7 (SEQ ID NO:20), to yield a detectable amount of wild-type DNA; (f) cleaving the amplified wild-type DNA with the selected restriction enzyme to yield at least one wild-type DNA fragment; (g) employing the technique of gel electrophoresis to separate the wild-type DNA fragments; (h) using restriction fragment length polymorphism (RFLP) analysis to compare the number and location of the DNA fragments of the M. tuberculosis isolate in step (d) to the number and location of the wild-type DNA fragments in step (g) to determine whether the amplified portion of the katG gene of the M. tuberculosis isolate comprises a different number of restriction sites cleaved by said selected restriction enzyme, said difference being indicative of an isoniazid resistant strain of M. tuberculosis in said isolate.
 10. A method for determining the susceptibility of a strain of M. tuberculosis to isoniazid comprising determining whether or not DNA comprising the katG gene of said strain has a restriction site selected from the group consisting of:(a) an NciI-MspI restriction site comprising the nucleotide position occupied by base 1457 in codon 463 of the M. tuberculosis katG gene consensus sequence, which codon 463 is represented by bases 1456 through 1458 depicted in FIG. 7 (SEQ ID NO:20); (b) an MspI restriction site comprising the nucleotide position occupied by base 1013 in codon 315 of the M. tuberculosis katG gene consensus sequence, which codon 315 is represented by bases 1012 through 1014 depicted in FIG. 7 (SEQ ID NO:20); and (c) a BstNI restriction site comprising the nucleotide position occupied by base 1457 in codon 463 of the M. tuberculosis katG gene consensus sequence, which codon 463 is represented by bases 1456 through 1458 depicted in FIG. 7 (SEQ ID NO:20);wherein each of the absence of an NciI-MspI restriction site associated with codon 463, the presence of an MspI restriction site associated with codon 315, and the presence of a BstNI restriction site associated with codon 463 is indicative of an isoniazid resistant strain of M. tuberculosis.
 11. A method for determining the susceptibility of a strain of M. tuberculosis to isoniazid comprising the steps of:(a) amplifying a portion of the katG gene of an M. tuberculosis isolate to yield a detectable amount of DNA comprising at least one MspI restriction site, and nucleotide positions occupied by bases 1013 and 1457 of the M. tuberculosis katG gene consensus sequence depicted in FIG. 7 (SEQ ID NO:20); (b) cleaving the amplified DNA with a restriction endonuclease at said restriction site to yield DNA fragments; and (c) employing the technique of gel electrophoresis to determine whether the number and location of the DNA fragments is indicative of(1) the presence on said portion of the katG gene of an MspI restriction site associated with codon 315, which associated MspI restriction site comprises the nucleotide position occupied by base 1013 of the M. tuberculosis katG gene consensus sequence depicted in FIG. 7 (SEQ ID NO:20), or (2) the absence on said portion of the katG gene of an MspI restriction site associated with codon 463, which associated MspI restriction site comprises base 1457 of the M. tuberculosis katG gene consensus sequence depicted in FIG. 7 (SEQ ID NO:20),wherein each of said presence of an MspI restriction site associated with codon 315 and said absence of an MspI restriction site associated with codon 463 is indicative of an isoniazid resistant strain of M. tuberculosis in said isolate.
 12. The method of claim 11, wherein the restriction endonuclease used to cleave the amplified DNA is MspI.
 13. The method of claim 11 wherein the amplified DNA comprises nucleotide bases 904 through 1523 depicted in FIG. 7 (SEQ ID NO:20).
 14. The method of claim 11 wherein polymerase chain reaction (PCR) is employed to amplify said portion of the katG gene of said isolate.
 15. The method of claim 14 wherein the PCR employs the oligonucleotide primer pair of AGCTCGTATGGCACCGGAAC (SEQ ID NO:16) and TTGACCTCCCACCCGACTTG (SEQ ID NO:17), or subunits thereof which hybridize to the DNA of SEQ ID NO:20.
 16. The method of claim 14 wherein the PCR employs the oligonucleotides AGCTCGTATGGCACCGGAAC (SEQ ID NO: 16), TTGACCTCCCACCCGACTTG (SEQ ID NO: 17), or subunits thereof which subunits are effective for the amplification of a region of DNA incorporating codon 463 of the M. tuberculosis katG gene (SEQ ID NO:20).
 17. An oligonucleotide selected from the group consisting of SEQ D NO:16, SEQ ID NO:17, SEQ D NO:18, SEQ ID NO:19, and subunits thereof of at least 7bases which subunits are effective for the amplification of a region of DNA incorporating codon 463 of the M. tuberculosis katG gene (SEQ ID NO:20).
 18. The oligonucleotide of claim 17 selected from the group consisting of SEQ ID NO:16, SEQ ID NO:17, SEQ ID NO:18 and SEQ ID NO:19.
 19. An isolated DNA molecule consisting of an M. tuberculosis katG gene, or fragment thereof of at least 7 bases, wherein said gene or fragment thereof comprises SEQ ID NO:13 OR SEQ ID NO:15.
 20. A method for determining the susceptibility of a strain of M. tuberculosis to isoniazid comprising determining whether or not the DNA of said strain has an MspI restriction site comprising the codon corresponding to codon 315 of the M. tuberculosis katG gene consensus sequence depicted in FIG. 7 (SEQ ID NO:20), wherein the presence of said restriction site is indicative of an INH resistant strain. 